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1. Introduction 

In a neural network, the functional network structure should depend on the anatomical 
structure of the network, which reflects the spatial distribution of neurons [H [2]. 
Similarly to such biological systems, physical space as well as mathematical topology is 
relevant also for spatial networks of social systems. This is reflected by the old proverb 
saying that out of sight, out of mind, in accord with which people tend to make their 
relationships in their neighborhood. Even in the case of the internet network as a 
distinctive social network, which is in itself completely independent of the geographic 
world, its physical realizations such as routers and links are located in the geographic 
real world [3]. Obviously, the spatial distance discussed here is not limited to the 
Euchdean distance and general geographic measure can be defined appropriately on a 
case-by-case basis. Particularly for the subway system, which provides a good example 
of the transportation network, geographic space affects the structure of the system in 
a critical way. For the analysis of traffic flows on such a transportation network, it is 
appropriate to introduce weighted networks IHE]. 

So far, various approaches to spatial networks have been developed [6]. In 
particular, emergence of skew distributions like power-law distributions, possibly 
with cut-offs, or log-normal distributions was reported extensively in diverse systems 
including maritime transportation [7], London street networks |H], and passenger flows 
on subway networks in Seoul [9] and in London [10]. Such skew distributions in spatial 
networks should naturally emerge from the growth and evolution of elements of the 
systems. The Yule process or preferential attachment [TTj is well known to generate a 
power-law distribution whereas the cut-off behavior in the power-law distribution can 
be attributed to dependence on the distance [I2] or geo-political constraints [I3]. Also 
the emergence of various kinds of networks or distributions may be controlled by the 
limitation rule on the addition of new links [11] . These demonstrate that the structure 
of a system can be closely related to the time evolution of the system. Accordingly, 
examining the growth of a system, one may obtain crucial information on interesting 
characteristics of the system such as the size distribution. 

In this paper, we study the time evolution of the passenger flow on a subway 
transportation network. The number of passengers in a subway system evolves with 
the urban development. For example, if a new factory is built or a financial company 
is relocated in the city, the total number of passengers using the subway station near 
these facilities or taking trips between the subway stations near these facilities and those 
near the residential area of laborers should grow. The resulting time evolution of the 
number of passengers can be described conveniently by the master equation, which was 
proposed for growth [15]. Based on the master equation, where the transition rate is 
constructed from simple geographical consideration, we obtain the evolution equation 
for the distribution of subway passengers. It is disclosed that the equation admits skew 
distributions including the log-normal and Weibull distributions as well as the power- 
law one. We then consider the Metropolitan Seoul Subway (MSS) system, in which the 
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trip trajectory data are available for all passengers in a day. Analysis of the huge data, 
consisting of more than 10^ trips, indeed reveals skew distributions. The origin of such 
distributions together with their implications is discussed. 

This article consists of four sections: Section |2] presents the master equation 
approach to the growth and branching processes. In Sec. [3]the MSS system is considered 
and the passenger data are analyzed, disclosing skew distributions. Finally, Sec. H] 
discusses the obtained results and gives a summary. 



2. Master equation and skew distributions 

We consider the time evolution of the number of subway passengers x using a given 
station or taking trips between two stations. The passenger number x will change to 
x' when a factory or an apartment complex is built; this will be called an event. The 
amount of the change. Ax = x' — x is expected to depend on the present number x. 
Assuming the rich-get-richer principle, we take Ax to be proportional to x: Ax = bx 
with the growth factor b. The most simple form of the transition rate then reads 

u{x —J- x') = \6{x' — X — bx) (1) 

where A is the growth rate or the occurrence rate of the events affecting x. 

With the transition rate above, we now establish the master equation for the 
probability of the passenger numbers. Suppose that there are elements, which 
correspond to either A^ subway stations or A links between any pairs of subway 
stations. (Note that links are associated with passengers taking trips between origin 
and destination stations, and there is a link not only between two adjacent stations on 
one subway line but also between any two stations in the whole subway system.) The 
number of passengers of the ith element is denoted by Xj. We then follow Ref. [15] to 
write the master equation for the probability P(xi, X2, ■ ■ ■ , xa?; t) for the subway system 
in specific configuration {xi, X2, ■ ■ ■ ,Xn} at time t: 

— P(xi, ■ ■ ■ ,xn; = / dx^- [u{x'j -> Xj)P{xi, ■■■ , 4, ■ ■ ■ ,XN;t) 

j 

- u{Xj x'j)P{xi, ■■■ , Xj, • ■ ■ ,XN]t)] (2) 

where the transition rate u is given by Eq. ([1]). 

Here we introduce the distribution function f{x,t) of the passengers [U], which is 
related to the system configuration probability via 

f{x,t) = J dxi ■ ■ -dxAT ^5(xi - x)P(xi, ■ ■ ■ ,xn; t). (3) 



Equation ([2]) then takes the form 



/(x, t) =— / dxi ■ ■ ■ dxAfdx^ 6{xi — x)[6{{l+b)x[ — Xi)P(xi, ■ ■ ■ , x^, ■ ■ ■ , xn] t) 



(5(x- - (l+6)xi)P(xi, ■ ■ ■ , Xi, ■ ■ ■ , Xiv; t)], (4) 
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where the summation over the index j has been performed. Integrating over x'^ with 
care to the accurate normahzation, we arrive at the equation 

|/(.,<) = -A/(.,*) + ^/(^), (5) 

which describes the time evolution of the distribution function. 

It is straightforward to compute the stationary solution of Eq. ([5]). Setting 
f{x,t) = fs{x) and the left-hand side of Eq. equal to zero, we obtain 

fs{x)r^X-\ (6) 

Namely, Eq. ([5]) admits the trivial stationary solution, which is the power-law 
distribution with the exponent unity. 

To probe the evolving (time-dependent) solution of Eq. ([5]), we consider the 
passenger number x on the logarithmic scale. On the linear scale, the number is 
determined by multiplying appropriate times by a random variable X which takes the 
value 1 or (!+&) independently (and obviously has a finite second moment). Then on the 
logarithmic scale, it is given by adding independently or ln(l-|-6) appropriate times, 
which, according to the central limit theorem, results in a Gaussian distribution (for 
logarithms of x). This can be obtained directly from the time evolution equation by 
changing the variable from x to X = Inx. In terms of the corresponding distribution 
function F{X,t) = e"^/(e^,t), Eq. ([5]) simply reads 

^F(X, t) = -XF{X, t) + \F{X - a, t) (7) 

with a = ln(l + 6), which describes exactly the random walk and yields the Gaussian 
(normal) distribution as the asymptotic solution. This corresponds to a log-normal 
distribution for x as the solution of the original model described by Eq. (|5]). 
Indeed direct substitution confirms that Eq. ([5]) bears the log-normal distribution 
asymptotically, with time-dependent parameters. Specifically, inserting the log-normal 
distribution 

f{x,t) = ^^e-^^'-^-^^' (8) 
y iTxax 



in Eq. (|5]), we have 



1 
A 



\ — 5-(lna; - yU.) H — Amx - fi) 



exp<{ ^^[21na;-ln(l+6) -2/i] \ - 1. 



(9) 

Expanding the right-hand side with respect to a^^ gives the time evolution of the mean 
and deviation parameters /i and a of the log-normal distribution, obtained to the order 
of 

/i = aXt 

a = ay/Xt (10) 
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with a = ln(l+6). Note that At, product of the growth rate A and elapsed time t, 
corresponds to the growth frequency, i.e., the number of events occurred. It is also 
straightforward to generalize Eq. (|5]) for the system where there occur various types of 
event described by different values of A and b. For example, in the case of two types of 
event characterized by (Ai,6i) and (A2,62), the evolution equation reads 

= -A,/(,x.O + ^/ (^) - M + ^/ (^) . (11) 

The log-normal distribution in Eq. |8] again makes a solution, where the mean and the 
deviation are given by /i = (aiAi + a2\2)t and a = A/(afA7+~a|A2jt with ai = ln(l+6i) 
and 02 = ln(l+62)- Fitting the data at given time to Eq. (|8]), we can surmise the time 
evolution of the number of passengers and predict the distribution in the future. 

Heretofore the subway system has been considered as fixed. In reality, the number of 
stations or links in a subway system can change with time as construction of the system 
proceeds. Usually, a new subway station is added as a member of a newly constructed 
subway line, which connects, like most existing lines, downtowns and suburbs or business 
districts and residential areas. In consequence, the distribution of passengers due to 
newly constructed stations, i.e., passengers using new stations and links, is expected to 
be similar to that of existing stations and links. In case that the interactions between 
existing stations and new ones are negligible, Eq. (jS]) is also applicable even when new 
stations and links are included. It is, however, conceivable that increases in the numbers 
of passengers using new stations or links are related to the numbers of passengers of 
existing stations. Namely, additional stations and links in the subway system should 
induce redistribution of passengers as well as influx of new passengers (e.g., from newly 
accessible parts of the city). Such redistribution corresponds to the branching process, 
which can be described similarly to the growth process. In particular, the simple 
branching process including just one scale of branching like binary fission is described 
by the evolution equation in Eq. f lTTj) with 62 less than zero. (Note that a negative value 
of b implies a decrease of passengers.) Whereas the terms involving Ai and bi represent 
the growth as the terms in Eq. ([5]), those involving A2, related to the construction rate 
of new stations, and &2, reduction factor, correspond to the branching process. 

Equation (ITTl) admits the WeibuU distribution of the form 

/(x,t) = ^f^ye-(^/^)^ (12) 
X \ri J 

in addition to the log-normal distribution. Inserting Eq. f|T2|) and expanding as before 
Eq. OH]) indeed show that the Weibull distribution is an asymptotic solution, with the 
shape and scale parameters given by 

1 

a/ (afAi + a|A2)t' 

?7 = exp [(aiAi + a2A2)t]. (13) 
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While the log-normal distribution has bilateral symmetry after transformed to the 
normal distribution, the WeibuU distribution still has asymmetry, skewness leaned to 
the front and having a light tail, as the consequence of the branching process. 

To conclude, emergence of the log-normal or Weibull distribution depends on the 
manner of production of new elements, i.e., construction of new subway lines. In case 
that only existing elements grow independently by given growth factor with no elements 
created newly or that the weight and the strength distributions of new elements are 
similar to those of existing stations and links (corresponding to the self-size production), 
the size distribution of the system is expected to follow the log-normal distribution. 
On the other hand, if new elements are created from fission and the size of newly 
created elements is determined by the redistribution of the size of pre-existing elements 
(corresponding to the branching process), the Weibull distribution is expected to emerge. 
Detailed results for these production rules can be found in Ref. [T6] . 

Further, these processes can affect alternatively the growth of the system in a 
complementary manner. For example, one may consider the case that a portion of the 
passengers at a subway station consists of entirely new users of the subway system, i.e., 
incomers to the city or those who have used other means of transportation, and the other 
portion consists of passengers from other subway stations. The resultant distribution 
of weights or strengths will then lie somewhere in between the log-normal distribution 
and the Weibull one. 

3. Metropolitan Seoul Subway system 

In this section, equipped with the results of Sec. [21 we analyze the passenger data of the 
MSS system. We first define such terms as the weight, directed weight, and strength, 
following Ref. [9]: The weight w of a. link between a pair of subway stations is defined to 
be the total number of passengers taking trips between the two stations, flowing in both 
directions, whereas the directed weight is the number of passengers in one direction, i.e., 
from one station (origin) to the other (destination). The (total) strength s of a subway 
station is defined to be the total number of passengers using the station, so that the 
strength is equal to the sum of the weights of the links between the station and others. 
We also define the strength of departure and that of arrival as the numbers of passengers 
departing from and arriving at the station. Obviously, the sum of the departure and 
arrival strengths gives the total strength. 

There are 357 subway stations and 127,092 links between stations in the MSS 
system data used in this study, giving the data set to be handled: X = {xi, ■ ■ ■ 
with n = 357 for strengths and n = 127, 092 for weights. We deal with one-day data, 
which are tremendous in size, consisting of more then 10^ passenger trips (including 
bus trips). For convenience, we divide a day into three time zones: morning (4 am to 
10 am), afternoon (11 am to 4 pm), and evening (5 pm to 1 am), and separate the 
data accordingly, e.g., departure in the morning, arrival in the afternoon, etc. More 
specifically, 4,907,541 passengers depart from and arrive at stations in the whole day: 
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1,671,891 passengers depart in the morning, 1,128,482 passengers in the afternoon, and 
2,107,168 passengers in the evening. Similarly, 1,585,358 passengers arrive at stations 
in the morning, 1,082,196 passengers in the afternoon, and 2,239,987 passengers in the 
evening. Those arrivals and departures are counted independently, which results in that 
arrival and departure numbers in the strength data do not match with each other. This 
is due to the passengers who extend their trips beyond a single time zone, for example, 
getting on the subway in the morning and getting off in the afternoon. On the other 
hand, weights involve the whole duration of each trip, from the origin to the destination, 
and those passengers whose trips are extended beyond one time zone are excluded in 
the weight data. 

We first consider the spanning trees of passenger flows, where nodes represent 
subway stations, in the morning, afternoon and evening. The minimum spanning 
tree, which is widely employed to investigate the structure of a complex network, is 
constructed with weights not larger than those of other possible spanning tree. However, 
in the subway system, the links which have larger weights (passenger flows) are more 
important; those links make up the so-called maximum spanning tree (MST) [9]. Figure 
[1] presents the MSTs of passenger flows, built from the weight data in the morning, 
afternoon, and evening. The different MSTs in each time zone manifest that the hub 
structure changes with time in a day, which was not addressed in the previous study 
[9]. In the morning, large flows due to commuting passengers should occur between a 
number of (suburban) residential areas and a few (downtown) business districts; thus 
the latter appear as hubs, characterized by large degrees (numbers of connected links) 
in the MST. If the commuting pattern in the evening were the opposite to that in the 
morning, namely, if most passengers returned to their origins (residential areas) directly 
in the evening, the resulting MST would be the same as that in the morning, since the 
MST, constructed with (undirected) weights, does not discern the flow direction on each 
link. In reality, however, passengers tend to diffuse in the evening, rather than to return 
to their origins. In consequence the MST displays different structure, with more hubs 
usually of less degrees. 

Counting the degree k of each node (station) in the MST, we obtain the degree 
distribution P{k), which is shown in Fig. |2l We use the least-squares fitting, and 
observe the power-law behavior: P{k) ~ k'"^ with exponent 7 ~ 2.0, 1.8, and 1.7 for 
morning, afternoon, and evening flows, respectively. Note the slight variations of the 
exponent depending on the time zone. 

As suggested in Sec. [2l passenger data in the MSS system are fitted to the log- 
normal distribution and the WeibuU distribution. Both the least-squares fitting and the 
likelihood test method are used to obtain values of (/i, a) for log-normal distributions 
and (7, 77) for WeibuU distributions; then reduced ^^^^ log likelihood function In L 
are computed to recognize which distribution gives better fitting of the data of MSS 
system. Note that x^, given by the sum of the squared differences between the data 
points and the fitted distribution, is widely used to test justification of the fit. The 
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(a) Morning 




(b) Afternoon 




(c) Evening 



Figure 1. Maximum spanning trees of passenger flows in the Metropolitan Seoul 
Subway System. Names of some hub stations are provided for reference. 



Master equation approach to the subway passenger flow 



9 



Q. 



10': 


°^^y=2.0 


1 ' ' . 


10': 






10 : 


(a) morning 

1 ^ 


CNO - 


10' 


^^^^ 7=1 8 
: ^^^A 


1 1 M 


10' 

10° 


(b) afternoon 


A^NA 1 
A : 
A : 

\^A 

A^~A^A 

1 : 




h-H 1 ^ 

□ 




10' 






10' 






10° 


■(c) evening 

: . . 1 


□ -. 
1 ■ ■ ■ 




10° 


10' 



k 



Figure 2. Probability distribution of the degree k of the maximum spanning tree in 
Fig. [H plotted on the log-log scale. Lines serve as guides to the eye. 



likelihood function L is defined to be [T7] 

n 

L{a,(3) = l[f{xf,a,fi) (14) 
1=1 

where a and /3 are parameters of the distribution function f{xi;a,(3) to test, and Xj's 
are elements of the data set X. We first find the maximum likelihood estimators 
a and /3, which lead to the maximum value of the likelihood function. Among the 
tested distributions, the very distribution with the largest maximized likelihood value 
calculated from the maximum likelihood estimators is taken to be the best distribution 
describing the data. Validity of the likelihood test may be naively understood if the 
conservation normalization is proposed as the constraint [T7] . 

Here we remark that there exist links with zero weight although all stations have 
finite strengths. In general, links with zero weight are categorized into two groups: 
One includes links connecting stations which are very far from each other. In this case, 
passengers would rather take trips to the sub-center of the city closer to the origin. The 
other consists of links involving several transfers. Namely, transfer to a different subway 
line causes inconvenience and tends to be avoided; it is likely to use other transportation 
modes, e.g., cars and buses. These features do not change with the growth of the city 
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Table 1. Obtained values of chi square and log of the likelihood function 



or the evolution of the subway system. In addition, the number of passengers of an 
element, once it is zero, may not grow in our model, which allows one to exclude such 
links with zero weight in analyzing the data. 

Computed values of chi square ^ind log of the likelihood function In L are given 
in Table [H In most cases, the least-squares fitting and the likelihood test agree with 
each other on the fitted distribution. In case that they disagree, the difference in the 
likelihood value as well as in the value of log-normal and WeibuU distributions is 
small compared with the cases displaying agreed results. This indicates that data lie 
somewhere in between the log-normal distribution and the WeibuU one. The likelihood 
ratio test has been used to discriminate between log-normal and WeibuU distributions, 
in comparison with the chi-square goodness of the fit [181 IlHl EDI EH [22] . In those works 
the likelihood test method is in general favored. Therefore we present data with the 
likelihood test method rather than least-squares fitting. Typical data together with 
fitted distributions via various methods are exhibited in Figs. [3] to [71 the strength 
distribution f{s) in Figs. [3l[ll and[5lon the linear scale; the weight distribution f{w) in 
Figs. [6] and [7] on the logarithmic scale. 

In most cases, log-normal distributions describe the data better than WeibuU 
distributions, except for the total and departure strength data in the morning, arrival 
strength data in the evening, and all-day departure strength data. Whereas the 
strength data of morning departure fit better to the WeibuU distribution, other data lie 
somewhere in between the log-normal distribution and the WeibuU one. We therefore 
choose the log-normal distribution, and compute the growth factor b and the growth 
frequency \t from Eq. (fTOl) . which are presented in Table. [2 In general larger values 
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Figure 3. All-day arrival strength data, plotted in circles, together with the fitted 
log-normal distributions obtained by means of the likelihood method (red solid line) 
and least-squares fitting (green dotted line). Both methods give similar values of the 
parameters fi and cr within the error bar. 
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Figure 4. Strength data in various cases, plotted in circles, are shown to fit well 
to log-normal distributions. Both the log-normal distributions (red solid lines) and 
Weibull distributions (green dotted lines) have been obtained by mens of the likelihood 
method. 
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Figure 5. Strength data fitted somewhat better to WeibuU distributions. Both 
log-normal and WeibuU distributions have been obtained by means of the likelihood 
method. Symbols are the same as those in Fig. |4l 




10° 10^ 10^ 10^ 



w 

Figure 6. Data of the directed weights in the morning on the logarithmic scale, 
together with log-normal distributions obtained by means of the likelihood method 
and least-squares fitting. They give slightly different values of the parameters. In 
general, the likelihood test gives better results except for the region of small weights. 
Symbols are the same as those in Fig. [3l 

of b indicate that the elements (stations or hnks) have gone through larger changes in 
each event, while the total number of events occurred is given by At. 

It is evident that arrival in the morning and departure in the evening are mostly 
related to stations in downtown or business districts of the city. Conversely, arrival in 
the evening and departure in the morning are mainly associated with the residential 
areas and suburbs of the city. In Table |2] distributions of morning arrival and evening 
departure strengths exhibit relatively large values of b but smaller values of Xt; this 
indicate that the downtown areas have suffered fewer number of changes, each of which 
is, however, more drastic. On the contrary, from morning departure and evening arrival 
data, it is concluded that the suburbs and residential areas have experienced gradual 
changes with larger frequency. It is also of interest to note that afternoon data represent 
the distributions evolved most drastically. This may be attributed to subway stations 
close to such facilities as major transportation centers, university towns, and tourist 
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Figure 7. Various weight data on the logarithmic scale. All data fit well to log-normal 
distributions. Symbols are the same as those in Fig. 21 
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1.383 


1.747 






8.326 


8.122 


8.964 


1.895 


2.365 


Evening 




0.976 


1.064 


0.982 


1.450 


1.572 




b 


0.121 


0.149 


0.114 


2.034 


1.842 




Xt 


72.70 


58.31 


83.25 


1.708 


2.264 






9.100 


9.138 


9.815 


2.506 


3.046 


All day 


a 
b 


0.971 
0.109 


0.934 
0.100 


0.949 
0.096 


1.586 
1.727 


1.694 
1.565 




Xt 


87.74 


95.78 


106.93 


2.498 


3.234 



Table 2. Growth factor and growth frequency calculated from the mean and deviation 
parameters (/x, a) 
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spots, which attract more passengers in the afternoon. Indeed a few such stations, 
including particularly the station at the express bus terminal, are found to have rather 
large afternoon weights. 

In the case of the weight distribution, the log-normal distribution provides better 
fits at all times and for both directed and non-directed data. As expected, the 
afternoon data show most drastically evolved distributions, similarly to the strength 
distribution. On the other hand, for the strength distribution, although most cases 
also follow manifestly log-normal distributions, there are several cases displaying some 
characteristics of WeibuU distributions as well. Those cases lying in between log-normal 
and WeibuU distributions include strength distributions of morning departure, evening 
arrival, morning total, and all-day departure, among which the former two, morning 
departure and evening arrival, constitute the most conspicuous examples. 

4. Discussion 

We have proposed the master equation approach to the evolution of passengers in 
a subway system. With the transition rate constructed from simple geographical 
consideration, we have obtained the evolution equation for the distribution of subway 
passengers, which has been found to admit skew distributions including log-normal and 
WeibuU distributions. Then the Metropolitan Seoul Subway (MSS) system has been 
considered and the huge trip data of all passengers in a day have been analyzed by 
time zones, which reveals that the passenger trip data in most cases fit well to the log- 
normal distributions. In particular, it has been manifested that while the suburbs have 
been developed gradually and continually, the downtown areas have undergone rapid 
development intermittently. This reflects the trend that the evolution of the city depends 
mainly upon the development of a few downtown areas whereas the population is 
distributed to various suburbs and residential areas. Further, the afternoon data exhibit 
most drastic evolution, resulting mainly from subway stations close to such facilities as 
train stations, bus terminals, university towns, and tourist spots. Construction of such 
major facilities is thus expected to have a large effect on the passenger trip, and the 
possibility of swelling a number of passengers drastically should be considered. 

Unlike most cases displaying log-normal distributions, a few cases, particularly, the 
strength distributions of morning departure and evening arrival, follow more or less 
WeibuU distributions. According to the result of Sec. |2] that the branching process 
gives rise to the WeibuU distribution, we may understand emergence of the WeibuU 
distribution as follows: Presumably, construction of new subway lines will not affect 
much the passengers using the subway stations in the downtown areas. Accessibility to 
those areas, high enough, is already secured, and the establishment of additional subway 
stations may not affect immediately the movement to the downtown. Rather, an increase 
in passengers in the downtown areas and business districts of the city is largely related 
to the growth of the city itself and due to the influx from other regions or cities. The 
growth of the corresponding stations or links may then be regarded as independent of 
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each other. On the other hand, passenger distributions in the suburbs or residential 
areas are expected to exhibit different behavior. To begin with, construction of new 
subway stations should stimulate the movement between the catchment areas of the 
stations. For example, citizens having accessed to the MSS indirectly by means of other 
transportation means like shuttle buses to a pre-existing major transportation station, 
can now access to the MSS directly via new subway stations. These redistributions of 
the population and passengers bear resemblance to branching. To be specific, upon 
construction of a new subway station, the passengers using the pre-existing station are 
now divided into two parts: one continuing to use the pre-existing station and the 
other using the newly constructed station. It is usually more flexible for people to move 
residences than to move workplaces, which is reflected in the construction of new subway 
lines and stations. Accordingly, the distributions related to the residential areas, i.e., 
morning departure or evening arrival, have the origin, at least partially, in the branching 
process at least partially, and thus the characteristics of the Weibull distribution. 

The above discussion allows one to draw a conclusion on the growth of the city. 
The city first suffers rapid and drastic but infrequent growth as the business districts 
develop by absorbing populations from other regions. This is followed by gradual and 
continuous redistributions of the populations throughout the residential areas. The 
master equation approach in this paper makes clear these and helps one to understand 
the nature of the growth of Seoul City as well as the MSS. 

It is also of interest to notice the characteristic of the weight distribution depending 
on the time zone. While the afternoon data, like the strength distribution, show 
most drastically evolved distributions, the evening data show most gradually evolved 
distributions. Such behavior of the evening data reflects the trend that passengers 
diffuse more in the evening than in the morning, to other places such as entertainment 
districts or part-time workplaces [23]. This tendency is also reflected by the MST 
structure and the resulting exponent 7 of the degree distribution. The larger value of the 
exponent implies that passenger flows are concentrated on a few nodes (corresponding 
to downtown business districts). In contrast passenger flows diffusing to relatively many 
nodes result in less biased distributions and thus smaller values of the exponent. This 
explains the observation that the exponent takes a larger value in the morning (7 = 2.0) 
and a smaller value in the evening (7 = 1.7). 

The time evolution of the distribution of passengers may explicitly probed by 
analyzing time series data of the passengers in the subway system, which is left for further 
study. Despite the lack of such data sets in this work, we can still conclude that the 
suburbs have been undergone gradual development, i.e., small developmental changes 
rather frequently and the downtown areas have passed through rapid development, i.e., 
large changes once in a while. This reflects the fact that the evolution of a city is driven 
mainly by the development of a few downtown areas but the population is distributed 
in the suburbs and various residential areas of the city. 
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